CDS

Accession Number TCMCG078C07365
gbkey CDS
Protein Id KAG0460577.1
Location join(3367595..3367831,3367910..3367978,3368056..3368193,3368315..3368432,3368689..3368771,3369269..3369364,3369991..3370159,3370238..3370380,3370453..3370539,3370690..3370875,3371553..3371720,3371925..3372045,3372117..3372186,3372362..3372449,3372570..3372690,3373824..3374017,3374102..3374210,3374649..3374804,3379406..3379508,3379873..3380046,3380188..3380333,3421719..3421918)
Organism Vanilla planifolia
locus_tag HPP92_020874

Protein

Length 991aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000011.1
Definition hypothetical protein HPP92_020874 [Vanilla planifolia]
Locus_tag HPP92_020874

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 31 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00028        [VIEW IN KEGG]
R00801        [VIEW IN KEGG]
R00802        [VIEW IN KEGG]
R06087        [VIEW IN KEGG]
R06088        [VIEW IN KEGG]
KEGG_rclass RC00028        [VIEW IN KEGG]
RC00049        [VIEW IN KEGG]
RC00077        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01187        [VIEW IN KEGG]
EC 3.2.1.20        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00052        [VIEW IN KEGG]
ko00500        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00052        [VIEW IN KEGG]
map00500        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATGACGGGGTCGTGAACATGGAGAACAAAGCCGGTCGTATGGTATTCGAGCCTATCCTTGAGGAGGGGGTCTTTCGGTTCGATTGCTCGGGGACTGATCGCGCCGCCGCGTTTCCTAGCCTCTCCTTCGCCGATCCTAAGGTCAGGGAGACTCCTCTCGCTGTCCATGGAATGATCCCCGAGTTCGTCCCTGTCTTTGAGTGCGTTCATGGCCAGCAGAAGGTGCAGGTCCAGCTTCCTTTGGGGACATCTCTCTATGGAACTGGGGAAGTAAGCGGGCCGCTCGAGAGAACTGGAAAACGAATCTTTACATGGAACACGGATGCATGGGGTTATGGCCCCGGAACGACCTCCTTGTACCAGTCTCATCCTTGGGTTCTGGCTGTTTTTCCTGATGGGAAAAGCTTAGGTGTGCTTGCTGATACGACGAGGCGTTGTGAGATTGATCTCCGGGAGAATTCTACCATAAAGTTTGTATCTGCAGCTGTGTACCCTGTAATCACATTTGGCCCATTTGAGTCGCCTACTCGTGTCTTGATATCTTTGTCTCATGCAATAGGAACTGTTTTTATGCCTCCAAAATGGTCTCTTGGTTATCATCAATGCCGCTGGAGCTATGAGACTGATGCAAGAGTTCGTGAGGTGGCTACTAATTTTCTTGAAAGAGGCATACCTTGTGATGTTATATGGATGGACATTGACTACATGCATGGTTTCCGGTGCTTTACTTTTGATAAAGAGCGTTTTCCTGATCCGAAAGCTTTGGTGAATGACCTTCATGCCATGGGCATCAAAGCAATTTGGATGCTTGACCCTGGAATCAAACATGAGGAGGGTTATTTTGTTTATGAAAGTGGTTCCAAGCATAATTTATGGATCTTGAAGGAAGATGAGAATCTTTTTGTGGGGGATGTATGGCCAGGGCCTTGTGTGTTCCCAGATTTCACTAAGAAAGAAGCACGATTTTGGTGGGCTAATTTGGTAAAAGATTTTGTTTCTAATGGTGTTGATGGGATTTGGAATGATATGAATGAACCTGCTATTTTCAAAACGGTTACAAAAACGATGCCTGAAAGCAACATACACAGGGGAGATGCCGAACTTGGTGGTCGACAATCACACTCCCATTATCATAATGTATATGGCATGCTTATGGCAAGATCAACATATGAGGGAATGAAAATGGCTAATGAAGGAAAGCGTCCCTTTGTTCTCACTAGGGCTGGATTCATAGGAAGTCAGCGCTATGCTGCAACCTGGACCGGAGATAACTTGTCTAATTGGGAGCATCTGCATATGAGTGTGCCAATGGTTATTCAACTGGGTCTAAGTGGTCAGCCGTTATCAGGACCAGATATTGGTGGATTCGCTGGTAATGCAACTCCAAGGCTCTTTGGAAGATGGATGGGAGTGGGTGCCATGTTTCCATTTTGTCGTGGGCACTCTGAAGCTGGAACAATTGATCAGGAACCTTGGTCATTTGGAAAAGAGTGTGAAGAAATATGTCGATTGGCTATTTTAAGGCGGTCTAGGCTTATACCTCACATTTATACACTTTTCTATGAGGCCCATGCAAATGGAACTCCCATTATCTCGCCCACTTTTTTCGCTGATCCTAAGGACCAGAAATTGAGGAAAGTTGAAAATTCCTTTCTACTTGGATCACTTTTGGTTTGTGCAAGCACCATTCCTGAACGAGGATCACATGAATTATCCTTCACATTACCAGCTGGAACTTGGATGAGATTTGATTTTGATGATTCACATCCAGATTTGCCCATATTATTCTTGCAAGGAGGTTCAATACTTCCTGTGGGTCCTACTCTTCAGCATCTTGGTCAAGCTACTCGAACCGATGAGTTATCACTCTTTATAGCTTTAGACAAAAATGGTAAAGCTGAAGGAGTTTTGTTCGAGGATGATGGCGATGGTTATGGTTACACCCAGGGAGCCTATCTCTTGACCTACTATGCTGCAGCATTGAGCTCTTCTATTGTTACAGTGAGCATCTCCCGAACAGAAGGGTTGTGGAAGAGAGCCAATCGAAGTCTACATGTGCATGTCTTACTTGGTGGTGGAGCAATGGTAGAGGGTTGGGGAATTGATGGTGAAGAAGTGCAAATAACCATGCCTACAGAATCTGAGGTGTTTAACATGGCATCAGCAAGTGAAGCTCAACATAGGGAACGGATGGGTAAAGCTAAGCTTCTCCCAGATGCTGCTGCTATCTCTGGAAATAAGGGTTTTGAGCTATCCAAGACCCCTCTCGAGATCAAGGGTAGGGACTGGCTGCTTAAAGTGGTGCCATGGATTGGTGGTCGAATGATCTCCATGATACATCTTCCTTCAGCGACCCAGTGGCTTCACAGTAGGTTTGAAGCAGATGGATACGAAGAGTATAGCGGCATCGAATACAGATCTGCAGGATGCTCTGAAGAATATCAAGTTGTAGGGAGAAATCTCGAGCAGTCTGGGGAAGAAGAAGCTCTTACCCTAGAAGGAGATATTGGTGGTGGATTAGTGCTCCAACGCAGCATATTTATTCCTAAAGATGCTCCACAGATACTAGCGATATGTTCTCGCATAATAGCGCGAAATGTGGGTGCTGGCTCTGGTGGATTTTCAAGGATGGTTTGCTTGCGGGTGCACCCAACTTTTACCCTGTTGCATCCTGCCGAGGTGCTCGTTGTGTTCGACTCCATTGATGGCACAAAGCATGAGATCAGACCTGAAGCAGGAGAACAAACGTTGGAAGGAGATATCCTCCCTAATGGAGAATGGATGCTGGTTGACAAGTGCACGGGCCTGGGGCTTGTGAACAGATTTGATATCAACCAAGTGAACAAATGCATGATTCATTGGGGAAGTCGAACTGTTAATTTGGAGCTGTGGTCTGTAGAAAGGCCTGTTTCAGTGGAGACTCCCTTGGAGATTTCTCACGAATACGAGGTGAAGGAGGTGAACTTGTATTAG
Protein:  
MDDGVVNMENKAGRMVFEPILEEGVFRFDCSGTDRAAAFPSLSFADPKVRETPLAVHGMIPEFVPVFECVHGQQKVQVQLPLGTSLYGTGEVSGPLERTGKRIFTWNTDAWGYGPGTTSLYQSHPWVLAVFPDGKSLGVLADTTRRCEIDLRENSTIKFVSAAVYPVITFGPFESPTRVLISLSHAIGTVFMPPKWSLGYHQCRWSYETDARVREVATNFLERGIPCDVIWMDIDYMHGFRCFTFDKERFPDPKALVNDLHAMGIKAIWMLDPGIKHEEGYFVYESGSKHNLWILKEDENLFVGDVWPGPCVFPDFTKKEARFWWANLVKDFVSNGVDGIWNDMNEPAIFKTVTKTMPESNIHRGDAELGGRQSHSHYHNVYGMLMARSTYEGMKMANEGKRPFVLTRAGFIGSQRYAATWTGDNLSNWEHLHMSVPMVIQLGLSGQPLSGPDIGGFAGNATPRLFGRWMGVGAMFPFCRGHSEAGTIDQEPWSFGKECEEICRLAILRRSRLIPHIYTLFYEAHANGTPIISPTFFADPKDQKLRKVENSFLLGSLLVCASTIPERGSHELSFTLPAGTWMRFDFDDSHPDLPILFLQGGSILPVGPTLQHLGQATRTDELSLFIALDKNGKAEGVLFEDDGDGYGYTQGAYLLTYYAAALSSSIVTVSISRTEGLWKRANRSLHVHVLLGGGAMVEGWGIDGEEVQITMPTESEVFNMASASEAQHRERMGKAKLLPDAAAISGNKGFELSKTPLEIKGRDWLLKVVPWIGGRMISMIHLPSATQWLHSRFEADGYEEYSGIEYRSAGCSEEYQVVGRNLEQSGEEEALTLEGDIGGGLVLQRSIFIPKDAPQILAICSRIIARNVGAGSGGFSRMVCLRVHPTFTLLHPAEVLVVFDSIDGTKHEIRPEAGEQTLEGDILPNGEWMLVDKCTGLGLVNRFDINQVNKCMIHWGSRTVNLELWSVERPVSVETPLEISHEYEVKEVNLY